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1. Real Party in Interest 

The real party in interest is INTERNATIONAL BUSINESS MACHINES 
CORPORATION, the assignee of the entire right, title and interest in and to the subject 
application by virtue of an assignment of record. 

2. Related Appeals and Interferences 

None. 

3. Status of Claims 

Claims 1, 3-16, 18-30 and 32-42 are pending, stand rejected and are under appeal. 
A copy of the claims 1,3-16, 18-30 and 32-42 as pending is presented in the 
Appendix. 

4. Status of Amendments 

Claims 1, 3-16, 18-30 and 32-42 were not amended after Final Rejection. 

5. Summary of Claimed Subject Matter 

• A method for profiling computer program executions in a computer processing 
system having a processor and a memory hierarchy is claimed in claim 1 . The method 
includes the following steps. A computer program is executed. (Specification , p. 19, line 
7; p. 20, line 17). Profile counts are stored in a memory array. The profile counts are for a 
plurality of events associated with the execution of the computer program. The memory 
array is separate and distinct from the memory hierarchy so as to not perturb normal 
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operations of the memory hierarchy, (Specification , p. 14, lines 8-13; Figure 1, profile 
matrix 100; Figure 5, profile matrix 100). At least one of the plurality of events is selected 
for profiling. (Specification , p. 19, lines 11-14; Figure 3, step 304; Figure 4, step 404). 
The profile counts are updated for only the selected events. ( Specification , p. 19, lines 20- 
21; p. 17, lines 15-22). Compilation and optimization of the computer program is assisted, 
based upon the selected profile counts stored in the memory array. (Specification , p. 22, 
line 22-p. 26, line 14; p. 28, line 19-p.29, line 8). 

An apparatus for profiling computer program executions in a computer processing 
system having a processor and a memory hierarchy is claimed in claim 23. The apparatus 
includes the following. A memory array is provided that is adapted to store profile counts 
for events associated with execution of the computer program. The memory array is 
separate and distinct from the memory hierarchy so as to not perturb normal operations of 
the memory hierarchy. ( Specification, p. 14, lines 8-13; Figure 1, profile matrix 100; 
Figure 5, profile matrix 100). A controller is provided that is adapted to select the events 
for profiling and to update the profile counts of the selected events stored in said memory 
array. ( Specification , p. 19, lines 1 1-23; Figure 3, steps, 304, 310; Figure 4, step 404, 
410). A scaling circuit is provided that is adapted to scale the profile counts to prevent 
profile information overflow. (Specification , p. 15, lines 14-24). The computer processing 
system assists compilation of the computer program, based upon the profile counts stored 
in the memory array. ( Specification , p. 22, line 22-p. 26, line 14; p. 28, line 19-p.29, line 
8). 

A method for profiling computer program executions in a computer processing 
system having a processor and a memory hierarchy is claimed in claim 40. The method 
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includes the following steps. A computer program is executed. (Specification , p. 19, line 
7; p. 20, line 17). Event-specific profile counts are stored in a memory array. Each profile 
count is associated with an event associated with the execution of a path of the computer 
program. The memory array is separate and distinct from the memory hierarchy so as to 
not perturb normal operations of the memory hierarchy. (Specification, p. 14, lines 8-13; 
Figure 1, profile matrix 100; Figure 5, profile matrix 100). At least one of the plurality of 
event-specific profile counts is selected is selected for profiling the path of the computer 
program. (Specification , p. 19, lines 1 1-14; Figure 3, step 304; Figure 4, step 404). If at 
least one of the selected event-specific profile counts has exceeded a predefined threshold, 
the portions of the computer program associated with the event-specific profile counts are 
optimized more aggressively than other portions of the computer program. ( Specification , 
p. 22, lines 2-8; p. 22, line 22-p. 26, line 14; p. 28, line 19-p.29, line 8). 

6. Grounds of Rejection to be Reviewed on Appeal 

A. Claims 1, 4-8, 11-13, 16, 22, 38 stand rejected under 35 U.S.C. § 103(a) as 
being unpatentable over Krishnaswamy et al. (U.S. Patent No. 6.622,300) (hereinafter 
" Krishnaswamy "). 

B. Claims 3, 9-10, 23-30, 32-34, 37 and 39 stand rejected under 35 U.S.C. § 
103(a) as being unpatentable over Krishnaswamy in view of "Dictionary of Computing" 
(hereinafter " Dictionary "). 

C. Claims 40 and 42 stand rejected under 35 U.S.C. § 103(a) as being unpatentable 
over Krishnaswamy in view of Chang et al. "Using Profile Information to Assist Classic 
Code Optimizations" (hereinafter " Chang "). 
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7. Argument 

A. Introduction 

In rejecting claims under 35 U.S.C. § 103, the Examiner bears the burden of 
presenting a prima facie case of obviousness. In re Rijckaert, 9 F.3d 1531, 1532 (Fed. Cir. 
1993). The burden of presenting a prima facie case of obviousness is only satisfied by 
showing some objective teaching in the prior art or that knowledge generally available to 
one of ordinary skill in the art would lead that individual to combine the relevant teachings 
of the references. In re Fine, 837 F.2d 1071, 1074 (Fed. Cir. 1988). A prima facie case of 
obviousness is established when the teachings of the prior art itself would appear to have 
suggested the claimed subject matter to one of ordinary skill in the art. In re Bell, 991 F.2d 
781, 782 (Fed. Cir. 1993). The suggestion to combine the references should come from the 
prior art, and the Examiner cannot use hindsight gleaned from the invention itself to pick 
and choose among related disclosures in the prior art to arrive at the claimed invention. In 
re Fine, 837 F.2d at 1075. If the Examiner fails to establish a prima facie case, the 
rejection is improper and must be overturned. In re Rijckaert, 9 F.3d at 1532 (citing In re 
Fine, 837F.2dat 1074). 

It is respectfully submitted that at the very least, the reference of Krishnaswamv is 
legally insufficient to sustain a prima facie case of obviousness against independent claim 
1 . Further, it is respectfully submitted that at the very least, the references of 
Krishnaswamv and Dictionary , taken individually or in any combination, are legally 
insufficient to sustain a prima facie case of obviousness against independent claim 23. 
Additionally, it is respectfully submitted that at the very least, the references of 
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Krishnaswamv and Chang , taken individually or in any combination, are legally 
insufficient to sustain a prima facie case of obviousness against independent claim 40. 

For the reasons set forth below, Appellants respectfully request that the claim 
rejections under 35 U.S.C. § 103(a) be reversed. 

B. Claims 1, 4-8, 11-13, 16, 22, 38 stand rejected as being unpatentable over 
Krishnaswamv. 

(i). Krishnaswamv fails to teach or suggest "selecting at least one of 
the plurality of events for profiling," as claimed in claim 1. 

Claim 1 claims, inter alia, "selecting at least one of the plurality of events for 
profiling." The Examiner admits on page 3 of the Final Office Action (Paper no. 1 12904) 
that "Krishnaswamy's background did not explicitly state selected events." The Examiner 
then cites col. 6, lines 21-30 of Krishnaswamv as demonstrating "selecting at least one of 
the purality of 'events for profiling. In the Advisory Action mailed on April 6, 2005, the 
Examiner specifically points to the word "programmable" in the phrase "programmable to 
count events like," as disclosed in col. lines 24-28 of Krishaswamy . The Examiner seems 
to assume that the statement "programmable to count events like" necessarily teaches or 
suggests "selecting at least one of the plurality of events for profiling." The term 
"programmable" does not necessarily imply a particular function. In particular, the term 
"programmable" does not imply that a selection of events is necessarily accomplished. For 
example, the counters for Krishnaswamv can be programmed to count events A, B and C. 
However, that the counters can be programmed to count events A, B and C does not 
necessarily imply that only A can be selected, or that only A and B can be selected, or any 
of a variety of combinations for that matter. Basically, the Examiner is contending that the 
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word "programmable" can be broadly interpreted to mean any function under the sun. 
Clearly, such a misunderstanding is without merit, facially unreasonable, and cannot be 
allowed. 

Because Krishnaswamv neither teaches nor suggests each and every element of 
claim 1, it is respectfully asserted that no prima facie case of obviousness has been made 
out. Accordingly, the rejection of claims 1, 3-16, 18-22 and 38 should be reversed. 

(ii) . Krishnaswamv fails to teach or suggest "updating the profile 
counts for only the selected events" and "assisting compilation and 
optimization of the computer program, based upon the selected profile 
counts stored in the memory array," as claimed in claim 1, 

Because Krishnaswamv does not teach or suggest "selecting at least one of the 
plurality of events for profiling," as showin in part (B)(i) above, it logically follows that 
Krishaswamy does not disclose "updating the profile counts for only the selected events" 
and "assisting compilation and optimization of the computer program, based upon the 
selected profile counts stored in the memory array." 

Because Krishnaswamv neither teaches nor suggests each and every element of 
claim 1, it is respectfully asserted that no prima facie case of obviousness has been made 
out. Accordingly, the rejection of claims 1,3-16, 18-22 and 38 should be reversed. 

(iii) . Krishnaswamv fails to teach or suggest "storing, in a memory array, 
profile counts for a plurality of events associated with the execution of the 
computer program, the memory array being separate and distinct from 
the memory hierarchy so as to not perturb normal operations of the 
memory hierarchy," as claimed in claim 1. 

The Examiner incorrectly contends that counters of the PMU 90 in Krishnaswamv 
teach or suggest the "memory array," as claimed by Appellants. Such an assertion is 
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untenable and incorrect in the view of those skilled in the art. Although the Examiner is 
allowed a broad interpretation of the claim, the Examiner apparently forgets that the 
interpretation must be reasonable. No one skilled in the art would confuse the terms 
"memory" and "counters," as the Examiner contends. 

A "memory" (and "memory array") as used in the general terminology of computer 
architecture is distinguishable from a "counter." Common examples of memory include 
SRAM and DRAM. On the other hand, a "counter" is built with latches or flip flops, 
which are "clocked elements." It should further be noted that while memory arrays are 
addressed, counters cannot be addressed, in contradiction to the description of "memory" 
in the patent application and also as claimed in claim 33. 

Because Krishnaswamv neither teaches nor suggests each and every element of 
claim 1 j it is respectfully asserted that no prima facie case of obviousness has been made 
out. Accordingly, the rejection of claims 1, 3-16, 18-22 and 38 should be reversed. 

C. Claims 3, 9-10, 23-30, 32-34. 37 and 39 stand rejected under 35 U.S.C. S 
103(a) as being unpatentable over Krishnaswamv in view of Dictionary. 

(i). The combination of Krishnaswamv and Dictionary fails to teach 

or suggest "a controller adapted to select the events for profiling/' as 
claimed in claim 23, 

The Examiner applies the same rejection applied for "selecting at least one of the 
plurality of events for profiling," as claimed in claim 1, for "a controller adapted to select 
the events for profiling," as claimed in claim 23. Appellants submit that the arguments 
provided in section (B)(i) above apply similarly to this section. In particular, that the 
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counters in Krishnaswamv are "programmable" does not necessarily imply that a selection 
of the events is made. 

Because the combination Krishnaswamv and Dictionary neither teaches nor 
suggests each and every element of claim 23, it is respectfully asserted that no prima facie 
case of obviousness has been made out. Accordingly, the rejection of claims 23-30 and 
32-39 should be reversed. 

(ii). The combination of Krishnaswamv and Dictionary fails to teach 
or .suggest "a controller adapted... to update the profile counts of the 
selected events stored in said memory array" and "wherein the computer 
processing system assists compilation of the computer program, based 
upon the profile counts stored in the memory array," as claimed in claim 
23. 

Because the combination of Krishnaswamv and Dictionary does not teach or 
suggest "selecting at least one of the plurality of events for profiling," as showin in part 
(C)(i) above, it logically follows that the combination of Krishaswamv and Dictionary does 
not disclose "a controller adapted. . .to update the profile counts of the selected events 
stored in said memory array" and 4C wherein the computer processing system assists 
compilation of the computer program, based upon the profile counts stored in the memory 
array." 

Because the combination of Krishnaswamv and Dictionary neither teaches nor 
suggests each and every element of claim 23, it is respectfully asserted that no prima facie 
case of obviousness has been made out. Accordingly, the rejection of claims 23-30 and 
32-39 should be reversed. 
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(iii). The combination of Krishnaswamv and Dictionary fails to teach or 
suggest "a scaling circuit adapted to scale the profile counts to prevent 
profile information overflow/' as claimed in claim 23 and essentially as 
claimed in claim 10. 

The Examiner cites Dictionary as disclosing "a scaling circuit adapted to scale the 
profile counts to prevent profile information overflow," as claimed in claim 23 and as 
essentially claimed in claim 10. A generic definition of scaling does not provide for a 
scaling circuit or scaling functionality in the claims. Further, there is no clearer example of 
hindsight reconstruction than the one provided by the Examiner here. Nothing in 
Krishnaswamv or Dictionary indicate any motivation, teaching or suggestion for such a 
combination. The Examiner in hindsight merely found a definition in Dictionary to satisfy 
his rejection. It is impermissible to use the claimed invention as an instruction manual or 
'template" in attempting to piece together isolated disclosures and teachings of the prior 
art so that the claimed invention is rendered obvious. In re Fritch, 972 F.2d 1260, 1266, 
23 USPQ2d 1780, 1783 (Fed. Cir. 1992). There is simply no other way to explain the 
disparate citation to Dictionary other than improper hindsight reasoning. 

The Examiner states, without support, that it <fc would have been obvious [to 
combine Krishnaswamv with Dictionary ] because one of ordinary skill in the art would be 
motivated to adjust the stored value to the hardware/equipment (register size limitations) 
(Computing, page 432)." It is entirely unclear to Appellants how the citation to page 432 
of Dictionary supports the Examiner's statement. It is further unclear where such 
reasoning originated, other than from the Examiner's own imagination. The statement by 
the Examiner is clearly conclusory and without merit. 

Because the combination Krishnaswamv and Dictionary neither teaches nor 
suggests each and every element of claims 23 and 10, it is respectfully asserted that no 
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prima facie case of obviousness has been made out. Accordingly, the rejection of claims 
3, 9-10, 23-30 and 32-39 should be reversed. 

(iv). The combination of Krishnaswamy and Dictionary fails to teach or 
suggest "a memory array adapted to store profile counts for events 
associated with execution of the computer program, said memory array 
being separate and distinct from the memory hierarchy so as to not 
perturb normal operations of the memory hierarchy," as claimed in claim 
23. 

The Examiner incorrectly contends that counters of the PMU 90 in Krishnaswamy 
teach or suggest the "memory array," as claimed by Appellants. Such an assertion is 
untenable and incorrect in the view of those skilled in the art. Although the Examiner is 
allowed a broad interpretation of the claim, the Examiner apparently forgets that the 
interpretation must be reasonable. No one skilled in the art would confuse the terms 
"memory" and "counters," as the Examiner contends. 

A "memory" (and "memory array") as used in the general terminology of computer 
architecture is distinguishable from a "counter." Common examples of memory include 
SRAM and DRAM. On the other hand, a "counter" is built with latches or flip flops, 
which are "clocked elements." It should further be noted that while memory arrays are 
addressed, counters cannot be addressed, in contradiction to the description of "memory" 
in the patent application and also as claimed in claim 33. 

Because the combination of Krishnaswamy and Dictionary neither teaches nor 
suggests each and every element of claim 23, it is respectfully asserted that no prima facie 
case of obviousness has been made out. Accordingly, the rejection of claims 23-30 and 
32-39 should be reversed. 
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D. Claims 40 and 42 stand rejected under 35 U.S.C. § 103(a) as being 
unpatentable over Krishnaswamv in view of Chang. 



(i) . The combination of Krishnaswamv and Chang fails to teach or 
suggest "selecting at least one of the plurality of event-specific profile 
counts for profiling the path of the computer program/' as claimed in 
claim 40, 

The Examiner applies the same rejection applied for "selecting at least one of the 
plurality of events for profiling," as claimed in claim 1, for "selecting at least one of the 
plurality of event-specific profile counts for profiling the path of the computer program" as 
claimed in claim 40. Appellants submit that the arguments provided in section (B)(i) 
above apply similarly to this section. In particular, that the counters in Krishnaswamv are 
"programmable" does not necessarily imply that a selection of the events is made. 

Because the combination of Krishnaswamv and Chang neither teaches nor suggests 
each and every element of claim 40, it is respectfully asserted that no prima facie case of 
obviousness has been made out. Accordingly, the rejection of claims 40-42 should be 
reversed. 

(ii) . The combination of Krishnaswamv and Chang fails to teach or 
suggest "if at least one of the selected event-specific profile counts has 
exceeded a predefined threshold, optimizing the portions of the computer 
program associated with the event-specific profile counts more 
aggressively than other portions of the computer program," as claimed in 
claim 40. 

Because the combination of Krishnaswamv and Chang does not teach or suggest 
"selecting at least one of the plurality of event-specific profile counts for profiling the path 
of the computer program," as showin in part (D)(i) above, it logically follows that the 
combination of Krishaswamv and Chang does not disclose "if at least one of the selected 
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event-specific profile counts has exceeded a predefined threshold, optimizing the portions 
of the computer program associated with the event-specific profile counts more 
aggressively than other portions of the computer program." 

Because the combination of Krishnaswamy and Chang neither teaches nor suggests 
each and every element of claim 40, it is respectfully asserted that no prima facie case of 
obviousness has been made out. Accordingly, the rejection of claims 40-42 should be 
reversed. 

(iii). No proper suggestion., teaching or motivation is provided for the 
combination of Krishnaswamy and Chang. 

In combining Krishnaswamy and Chang, the Examiner states, without support, that 
the combination "would have been obvious because one of ordinary skill in the art would 
be motivated to optimize frequently executed program paths primarily since they are 
executed more (page 1301, Introduction and pages 1306-1308)." The citation to Chang 
does -not provide any motivation or suggest to combine the optimizations of Chang with 
the PMU counters of Krishnaswamy . Again, as with section (C)(iii) above, the only 
plausible support for combining Krishnaswamy and Chang in such a manner is improper 
hindsight reasoning using the Appellants' disclosure. 

Because no proper motivation, teaching or suggstion is provided for the 
combination of Krishnaswamy and Chang , it is respectfully asserted that no prima facie 
case of obviousness has been made out. Accordingly, the rejection of claims 40-42 should 
be reversed. 
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(iv). The combination of Krishnaswamv and Chang fails to teach or suggest 
"storing, in a memory array, a plurality of event-specific profile counts, 
each associated with an event associated with the execution of a path of the 
computer program, the memory array being separate and distinct from 
the memory hierarchy so as to not perturb normal operations of the 
memory hierarchy," as claimed in claim 40. 

The Examiner incorrectly contends that counters of the PMU 90 in Krishnaswamv 
teach or suggest the "memory array," as claimed by Appellants. Such an assertion is 
untenable and incorrect in the view of those skilled in the art. Although the Examiner is 
allowed a broad interpretation of the claim, the Examiner apparently forgets that the 
interpretation must be reasonable. No one skilled in the art would confuse the terms 
"memory" and "counters," as the Examiner contends. 

A "memory" (and "memory array") as used in the general terminology of computer 
architecture is distinguishable from a "counter." Common examples of memory include 
SRAM and DRAM. On the other hand, a "counter" is built with latches or flip flops, 
which are "clocked elements." It should further be noted that while memory arrays are 
addressed, counters cannot be addressed, in contradiction to the description of "memory" 
in the patent application and also as claimed in claim 33. 

Because the combination of Krishnaswamv and Chang neither teaches nor suggests 
each and every element of claim 40, it is respectfully asserted that no prima facie case of 
obviousness has been made out. Accordingly, the rejection of claims 40-42 should be 
reversed. . 
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F. CONCLUSION 

The claimed invention is not disclosed or suggested by the teachings of the applied 
prior art references, either alone or in combination. Moreover, the Examiner has failed to 
establish a prima facie case of obviousness of the presently claimed method under 35 
U.S.C. § 103(a) over the combination of Krishnaswamv with various other references for 
at least the reasons noted above. Accordingly, it is respectfully requested that the Board 
reverse the rejection of claims 1, 3-16, 18-30 and 32-42 under 35 U.S.C. § 103(a). 



Respectfully submitted, 

By: // 
Koon Hon Wong 
Reg. No. 48,459 
Attorney for Appellants 



F. Chau & Associates, LLC 
130 Woodbury Road 
Woodbury, NY 11797 
TEL: (516) 692-8888 
FAX: (516)692-8889 
Attorneys for Appellants 
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Claims Appendix 

1 . A method for profiling computer program executions in a computer processing 
system having a processor and a memory hierarchy, comprising the steps of: 

executing a computer program; 

storing, in a memory array, profile counts for a plurality of events associated with 
the execution of the computer program, the memory array being separate and distinct from 
the memory hierarchy so as to not perturb normal operations of the memory hierarchy; 

selecting at least one of the plurality of events for profiling; 

updating the profile counts for only the selected events; 

assisting compilation and optimization of the computer program, based upon the 
selected profile counts stored in the memory array. 

2. (Cancelled). 

3. The method according to claim 1, wherein said storing and updating steps are 
performed asynchronously to prevent a decrease of an execution speed of the computer 
program. 

4. The method according to claim 1 , wherein said updating step is triggered by 
execution of the events. 

5. The method according to claim 1 , wherein said updating step is triggered by 
execution of instructions embedded into an instruction stream of the computer program. 

6. he method according to claim 1, further comprising the step of detecting 
whether a profile count has exceeded an adjustable predefined threshold. 

7. The method according to claim 1, further comprising the step of indicating when 
a profile count has exceeded an adjustable predefined threshold. 
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8. The method according to claim 7, wherein said indicating step comprises the 
step of raising an exception. 

9. The method according to claim 1, further comprising the steps of: 
accumulating profile updates; and 

dividing the accumulated profile updates by a threshold fraction. 

10. The method according to claim 1, further comprising the step of scaling the 
profile counts to prevent profile information overflow. 

1 1 . The method according to claim 1 , further comprising the step of identifying 
profile information corresponding to the profile counts using a profiling event identifier. 

12. The method according to claim 1 1, further comprising the step of addressing 
the memory array, using the profiling event identifier. 

13. The method according to claim 1, further comprising the steps of: 
generating the profile counts using profile counters associated with the events; and 
maintaining the profile counters in a set-associate manner. 

14. The method according to claim 13, further comprising the step of selecting a 
profile counter to be evicted from the memory array based upon a predefined replacement, 
when a number of profiling events assigned to an associative class of events is exceeded. 

15. The method according to claim 14, wherein the replacement strategy is based 
upon one of least-recently-used and first-in-first-out. 

16. The method according to claim 1, further comprising the step of supporting 
read operations from the memory array in an off-line optimization of the program. 

17. (Cancelled). 
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18. The method according to claim 1 5 wherein said assisting step is performed 
during at least one of dynamic binary translation and dynamic optimization of the 
computer program. 

19. The method according to claim 18, wherein the dynamic binary translation and 
dynamic optimization of the computer program results in translated and optimized code, 
respectively, the translated and optimized code comprising instructions groups which pass 
control therebetween. 

20. The method according to claim 19, further comprising the step of identifying 
frequently executed paths of the computer program, by instrumenting exits from the 
instruction groups with a profiling instruction that indicates a unique group exit identifier. 

21. The method according to claim 19, further comprising the step of extending the 
instruction groups along a frequently executed path. 

22. The method according to claim 1 , wherein the memory hierarchy includes data 
and instruction caches, and the memory array is separate and distinct from the memory 
hierarchy so as to not perturb normal operations of the data and instruction caches. 

23. An apparatus for profiling computer program executions in a computer 
processing system having a processor and a memory hierarchy, the apparatus comprising: 

a memory array adapted to store profile counts for events associated with execution 
of the computer program, said memory array being separate and distinct from the memory 
hierarchy so as to not perturb normal operations of the memory hierarchy; 

a controller adapted to select the events for profiling and to update the profile 
counts of the selected events stored in said memory array; and 

a scaling circuit adapted to scale the profile counts to prevent profile information 
overflow; 
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wherein the computer processing system assists compilation of the computer 
program, based upon the profile counts stored in the memory array. 

24. The apparatus according to claim 23, wherein said memory array and said 
controller are adapted to asynchronously store and update the profile counts, respectively, 
to prevent a decrease of an execution speed of the computer program. 

25. The apparatus according to claim 23, wherein said controller is adapted to 
update the profile counts as the events are executed. 

26. The apparatus according to claim 23, wherein said controller is adapted to 
update the profile counts based upon instructions embedded into an instruction stream of 
the computer program. 

27. The apparatus according to claim 23, further comprising a comparator circuit 
adapted to detect whether a profile count has exceeded an adjustable predefined threshold. 

28. The apparatus according to claim 23, further comprising an indicating circuit 
for indicating when a profile count has exceeded an adjustable predefined threshold. 

29. The apparatus according to claim 28, wherein said indicating circuit is adapted 
to raise an exception when the profile count has exceeded the adjustable predefined 
threshold. 

30. The apparatus according to claim 23, further comprising: 

an accumulation circuit adapted to accumulate the updated profile counts; and 
a dividing circuit adapted to divide an accumulated value of the updated 
accumulated profile counts by a threshold fraction. 

31. (Cancelled). 
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32. The apparatus according to claim 23, wherein profile information 
corresponding to the profile counts is identified using a profiling event identifier. 

33. The apparatus according to claim 32, wherein the memory array is addressed 
using the profiling event identifier. 

34. The apparatus according to claim 23, further comprising profile counters for 
generating the profile counts, said profile counters being associated with an event in a set- 
associate manner. 

35. The apparatus according to claim 34, further comprising a replacement circuit 
adapted to select a profile counter to be evicted from the memory array based on a 
predefined replacement strategy, when a number of profiling events assigned to an 
associative class is exceeded. 

36. The apparatus according to claim 35, wherein the predefined replacement 
strategy is based upon one of least-recently-used and first-in-first-out. 

37. The apparatus according to claim 23, wherein the memory hierarchy includes 
data and instruction caches, and said memory array is separate and distinct from the 
memory hierarchy so as to not perturb normal operations of the data and instruction 
caches. 

38. The method according to claim 1, wherein said method is implemented by a 
program storage device readable by machine, tangibly embodying a program of 
instructions executable by the machine to perform said method steps. 

39. The apparatus according to claim 23, further comprising wherein the computer 
processing system assists optimization of the computer program, based upon the profile 
counts stored in the memory array. 
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40. A method for profiling computer program executions in a computer processing 
system having a processor and a memory hierarchy, comprising the steps of: 

executing a computer program; 

storing, in a memory array, a plurality of event-specific profile counts, each 
associated with an event associated with the execution of a path of the computer program, 
the memory array being separate and distinct from the memory hierarchy so as to not 
perturb normal operations of the memory hierarchy; 

selecting at least one of the plurality of event-specific profile counts for profiling 
the path of the computer program; and 

if at least one of the selected event-specific profile counts has exceeded a 
predefined threshold, optimizing the portions of the computer program associated with the 
event-specific profile counts more aggressively than other portions of the computer 
program. 

41 . The method according to claim 40, further comprising the step of optimizing of 
the portions of the computer program during at least one of static and dynamic 
compilation. 

42. The method according to claim 40, wherein the memory array is arranged as a 
two-way set associative array. 
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Evidence Appendix 

None 

Related Procedings Appendix 

None 



